Constrained Reinforcement Learning for Vehicle Motion Planning with Topological Reachability Analysis

نویسندگان

چکیده

Rule-based traditional motion planning methods usually perform well with prior knowledge of the macro-scale environments but encounter challenges in unknown and uncertain environments. Deep reinforcement learning (DRL) is a solution that can effectively deal micro-scale Nevertheless, DRL unstable lacks interpretability. Therefore, it raises new challenge: how to combine effectiveness overcome drawbacks two while guaranteeing stability In this study, multi-constraint multi-scale method proposed for automated driving use constrained (RL), named RLTT, comprising RL, topological reachability analysis used vehicle path space (TPS), trajectory lane model (TLM). First, dynamic vehicles formulated; then, TLM developed on basis model, thus constraining RL action state space. Second, achieved through TPS, range, discrete routing points are via RLTT. Third, designed by combining sophisticated rules, theoretical provided guarantee efficiency our method. Finally, related experiments conducted evaluate method; reduce 19.9% distance cost as compared Experimental results indicate help mitigate gap between data-driven methods, provide better performance driving, facilitate more fields.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Motion Planning for the ATHLETE Rover with Reinforcement Learning

Legged locomotion is attractive because it can enable a robot to traverse far more varied terrain than a wheeled rover is capable of. In the context of planetary exploration, this is especially attractive as the sites of greatest scientific interest tend to be characterized by difficult terrain. For example, it would be extremely difficult for a wheeled rover to make its way into a lunar crater...

متن کامل

Robust Reinforcement Learning in Motion Planning

While exploring to find better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, or even catastrophic, results, often modeled in terms of reaching 'failure' states of the agent's environment. This paper presents a method that uses domain knowledge to reduce the number of failures during explorati...

متن کامل

A Reinforcement Learning Approach for Motion Planning of Hopping Rovers

This paper considers the problem of navigating a hopping rover on the surface of small Solar System bodies, such as asteroids and comets. Specifically, by controlling the torques applied to three internal flywheels, the rover is able to perform controllable long-range hops and short tumbling maneuvers. The highly stochastic dynamics of bouncing in microgravity and the sequential and discrete na...

متن کامل

Curiosity driven reinforcement learning for motion planning on humanoids

Most previous work on artificial curiosity (AC) and intrinsic motivation focuses on basic concepts and theory. Experimental results are generally limited to toy scenarios, such as navigation in a simulated maze, or control of a simple mechanical system with one or two degrees of freedom. To study AC in a more realistic setting, we embody a curious agent in the complex iCub humanoid robot. Our n...

متن کامل

Safety-Constrained Reinforcement Learning for MDPs

We consider controller synthesis for stochastic and partially unknown environments in which safety is essential. Specifically, we abstract the problem as a Markov decision process in which the expected performance is measured using a cost function that is unknown prior to run-time exploration of the state space. Standard learning approaches synthesize cost-optimal strategies without guaranteein...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Robotics

سال: 2022

ISSN: ['2218-6581']

DOI: https://doi.org/10.3390/robotics11040081